A Hidden Markov Technique for Haplotype Reconstruction
نویسندگان
چکیده
We give a new algorithm for the genotype phasing problem. Our solution is based on a hidden Markov model for haplotypes. The model has a uniform structure, unlike most solutions proposed so far that model recombinations using haplotype blocks. In our model, the haplotypes can be seen as a result of iterated recombinations applied on a few founder haplotypes. We find maximum likelihood model of this type by using the EM algorithm. We show how to solve the subtleties of the EM algorithm that arise when genotypes are generated using a haplotype model. We compare our method to the well-known currently available algorithms (phase, hap, gerbil) using some standard and new datasets. Our algorithm is relatively fast and gives results that are always best or second best among the methods compared.
منابع مشابه
Unsupervised Haplotype Reconstruction and LD Blocks Discovery in a Hidden Markov Framework
In the last years haplotype reconstruction and haplotype blocks discovery, i.e., the estimation of patterns of linkage disequilibrium (LD) in the haplotypes, riveted the attention of the computer scientists due to the involved strong computational aspects. Such tasks are usually faced separately; recently, statistical generative techniques permitted to solve them jointly. Following this trend, ...
متن کاملModified Internal Fixation Technique for Acromio-Clavicular (AC) joint dislocation: The “Hidden Knot Technique”
Acromioclavicular (AC) joint injuries are common and often seen in contact athletes, resulting from a fall on the shouldertip with adducted arm. This joint is stabilized by both static and dynamic structures including the coracoclavicular (CC)ligament. Most reconstruction techniques focus on CC ligament augmentation as the primary stabilizer of the AC joint.The best surgical technique for some ...
متن کاملJoint haplotype phasing and genotype calling of multiple individuals using haplotype informative reads
MOTIVATION Hidden Markov model, based on Li and Stephens model that takes into account chromosome sharing of multiple individuals, results in mainstream haplotype phasing algorithms for genotyping arrays and next-generation sequencing (NGS) data. However, existing methods based on this model assume that the allele count data are independently observed at individual sites and do not consider hap...
متن کاملTaylor Expansion for the Entropy Rate of Hidden Markov Chains
We study the entropy rate of a hidden Markov process, defined by observing the output of a symmetric channel whose input is a first order Markov process. Although this definition is very simple, obtaining the exact amount of entropy rate in calculation is an open problem. We introduce some probability matrices based on Markov chain's and channel's parameters. Then, we try to obtain an estimate ...
متن کاملIntroducing Busy Customer Portfolio Using Hidden Markov Model
Due to the effective role of Markov models in customer relationship management (CRM), there is a lack of comprehensive literature review which contains all related literatures. In this paper the focus is on academic databases to find all the articles that had been published in 2011 and earlier. One hundred articles were identified and reviewed to find direct relevance for applying Markov models...
متن کامل